Proc. Natl Acad. Sci. USA 

Vol 74, No. 11, pp. 5088-5090, November 1977 

Evolution 

Phylogenetic structure of the prokaryotic domain: The primary 
kingdoms 

(archaebacteria/eubacteria/urkaryote/16S ribosomal RN A/molecular phylogeny) 

Carl R. Woese and George E. Fox* 

Department of Genetics and Development, University of Illinois, Urbona, Illinois 61801 
Communicated by T. M. Sonneborn, August 18, 1977 



ABSTRACT A phylogenetic analysis based upon ribosomal 
RNA sequence characterization reveals that living systems 
represent one of three aboriginal lines of descent: (i) the eu- 
bacteria, comprising aD typical bacteria; (ii) the archaeDacteria, 
containing methanogenic bacteria; and (Hi) die urkaryotes, now 
represented in the cytoplasmic component of eukaryotic 

The biologist has customarily structured his world in terms of 
certain basic dichotomies. Classically, what was not plant was 
animal. The discovery that bacteria, which initially had been 
considered plants, resembled both plants and animals less than 
plants and animals resembled one another led to a reformula- 
tion of the issue in terms of a yet more basic dichotomy, that of 
eukaryote versus prokaryote. The striking differences between 
eukaryotic and prokaryotic cells have now been documented 
in endless molecular detail As a result, it is generally taken for 
granted that all extant life must be of these two basic types. 

Thus, it appears that the biologist has solved the problem of 
the primary phylogenetic groupings. However, this is not the 
case. Dividing the living world into Prokaryotae and Eukar- 
yotae has served, if anything, to obscure the problem of what 
extant groupings represent the various primeval branches from 
the common line of descent. The reason is that eukaryote/ 
prokaryote is not primarily a phylogenetic distinction, although 
it is generally treated so. The eukaryotic cell is organized in a 
different and more complex way than is the prokaryote; this 
probably reflects the former's composite origin as a symbiotic 
collection of various simpler organisms (1-5). However striking, 
these organizational dissimilarities do not guarantee that eu- 
karyote and prokaryote represent phylogenetic extremes. 

The eukaryotic cell per se cannot be directly compared to 
the prokaryote. The composite nature of the eukaryotic cell 
makes it necessary that it first be conceptually reduced to its 
phylogenetically separate components, which arose from an- 
cestors that were noncomposite and so individually are com- 
parable to prokaryotes. In other words, the question of the 
primary phylogenetic groupings must be formulated solely in 
terms of relationships among "prokaryotes" — i.e., noncomposite 
entities. (Note that in this context there is no suggestion a priori 
that the living world is structured in a dichotomous way.) 

The organizational differences between prokaryote and 
eukaryote and the composite nature of the latter indicate an 
important property of the evolutionary process: Evolution seems 
to progress in a "quantized" fashion. One level or domain of 
organization gives rise ultimately to a higher (more complex) 
one. What "prokaryote" and "eukaryote" actually represent 
are two such domains. Thus, although it is useful to define 
phylogenetic patterns within each domain, it is not meaningful 
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to construct phylogenetic classifications between domains: 
Prokaryotic kingdoms are not comparable to eukaryotic ones. 
This should be recognized by an appropriate terminology. The 
highest phylogenetic unit in the prokaryotic domain we think 
should be called an "urkingdom" — or perhaps "primary 
kingdom." This would recognize the qualitative distinction 
between prokaryotic and eukaryotic kingdoms and emphasize 
that the former have primary evolutionary status. 

The passage from one domain to a higher one then becomes 
a central problem. Initially one would like to know whether this 
is a frequent or a rare (unique) evolutionary event. It is tradi- 
tionally assumed — without evidence — that the eukaryotic 
domain has arisen but once; all extant eukaryotes stem from a 
common ancestor, itself eukaryotic (2). A similar prejudice holds 
for the prokaryotic domain (2). [We elsewhere argue (6) that 
a hypothetical domain of lower complexity, that of "pro- 
genotes," may have preceded and given rise to the prokaryotes. ) 
The present communication is a discussion of recent findings 
that relate to the urkingdom structure of the prokaryotic do- 
main and the question of its unique as opposed to multiple or- 
igin. 

Phylogenetic relationships cannot be reliably established in 
terms of noncomparable properties (7). A comparative ap- 
proach that can measure degree of difference in comparable 
structures is required An organism's genome seems to be the 
ultimate record of its evolutionary history (8). Thus, compar- 
ative analysis of molecular sequences has become a powerful 
approach to determining evolutionary relationships (9, 10). 

To determine relationships covering the entire spectrum of 
extant living systems, one optimally needs a molecule of ap- 
propriately broad distribution. None of the readily character- 
ized proteins fits this requirement. However, ribosomal RNA 
does. It is a component of all self-replicating systems; it is readily 
isolated; and its sequence changes but slowly with time— per- 
mitting the detection of relatedness among very distant species 
(11-13). To date, the primary structure of the 16S (18S) ribo- 
somal RNA has been characterized in a moderately large and 
varied collection of organisms and organelles, and the general 
phylogenetic structure of the prokaryotic domain is beginning 
to emerge. 

A comparative analysis of these data, summarized in Table 
1, shows that the organisms clearly cluster into several primary 
kingdoms. Hie first of these contains all of the typical bacteria 
so far characterized, including the genera Acetobacterium, 
Acinetobacter, Acholeplasma, Aeromonas, Alcaligenes, An- 
acystis, Aphanocapsa, Bacillus, Bfellovibrio, Chlorobium, 
Chromatium, Clostridium, Corynebacterium, Escherichia, 
Eubacterium, Lactobacillus, Leptospira, Micrococcus, My- 
coplasma, Paracoccus, PhotobacteHum, Propionibacterium, 
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Table 1. Association coefgrientg (Sab ) between representative members of the three primary kingdoms 
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4. Escherichia coli 


0.05 


0.10 


0.06 




0.24 


0.25 


0.28 


0.26 


0.21 


0.11 


0.12 


0.07 


0.12 


5. Chlorobium vibrioforme 


0.06 


0.05 


0.06 


0.24 




0.22 


0.22 


0.20 


0.19 


0.06 


0.07 


0.06 


0.09 


6. Bacillus firmus 


0.08 


0.06 


0.07 
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0.22 




0.34 


0.26 
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0.12 


7. Corynebacterium diphtheriae 


0.09 


0.10 


0.07 


0.28 


0.22 


0.34 




0.23 
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8. Aphanocapsa 6714 


0.11 
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0.09 
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0.31 
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9. Chloroplast (Lemna) 


0.08 
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0.21 
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10. Methanobacterium thermoautotrophicum 
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0.11 


0.14 
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11. Af. ruminantium strain M-l 
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12. Methanobacterium sp., Cariaco isolate JR-1 
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0.13 
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13. Methanosarcino barkeri 


0.08 


0.07 


0.07 


0.12 
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0.30 


0.24 
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The 16S (18S) ribosomal RNA from the organisms (organelles) listed were digested with Tl RNase and the resulting digests were subjected 
to two-dimensional electrophoretic separation to produce an oligonucleotide fingerprint The individual oligonucleotides on each fingerprint 
were then sequenced by established procedures (13, 14) to produce an oligonucleotide catalog characteristic of the given organism (3, 4, 13-17* 
22, 23; unpublished data). Comparisons of all possible pairs of such catalogs defines a set of association coefficients (Sab) given by: Sab - 
%Nab/(Na + Nb), in which N A , Nb, and Nab are the total numbers of nucleotides in sequences of hezamers or larger in the catalog for organism 
A, in that for organism B, and in the inteifeaction of the two catalogs, respectively (13, 23). 



Pseudomonas, Rhodopseudomonas, Rhodospirillum, Spiro- 
chaeta, Spiroplasma, Streptococcus, and Vibrio (refs. 13-17; 
unpublished data). The group has three major subdivisions, the 
blue-green bacteria and chloroplasts, the "Cram-positive" 
bacteria, and a broad "Gram-negative" subdivision (refs. 3, 4, 
13-17; unpublished data). It is appropriate to call this urking- 
dom the eubacteria. 

A second group is defined by the 18S rRN As of the eukaryotic 
cytoplasm — animal, plant, fungal, and shine mold (unpublished 
data). It is uncertain what ancestral organism in the symbiosis 
that produced the eukaryotic cell this RNA represents. If there 
had been an "engulfing species" (1) in relation to which all the 
other organisms were endosymbionts, then it seems likely that 
18S rRNA represents that species. This hypothetical group of 
organisms, in one sense the major ancestors of eukaryotic cells, 
might appropriately be called urkaryotes. Detailed study of 
anaerobic amoebae and the like (18), which seem not to contain 
mitochondria and in general are cytologically simpler than 
customary examples of eukaryotes, might help to resolve this 
question. 

Eubacteria and urkaryotes correspond approximately to the 
conventional categories "prokaryote" and "eukaryote" when 
they are used in a phylogenetic sense. However, they do not 
constitute a dichotomy; they do not collectively exhaust the class 
of living systems. There exists a third kingdom which, to date, 
is represented solely by the methanogenic bacteria, a relatively 
unknown class of anaerobes that possess a unique metabolism 
based on the reduction of carbon dioxide to methane (19-21). 
These "bacteria" appear to be no more related to typical 
bacteria than they are to eukaryotic cytoplasms. Although the 
two divisions of this kingdom appear as remote from one an- 
other as blue-green algae are from other eubacteria, they 
nevertheless correspond to the same biochemical phenotype. 
The apparent antiquity of the methanogenic phenotype plus 
the fact that it seems well suited to the type of environment 
presumed to exist on earth 3-4 billion years ago lead us tenta- 
tively to name this urkingdom the archaebacteria. Whether or 
not other biochemically distinct phenotypes exist in this king- 
dom is clearly an important question upon which may turn our 
concept of the nature and ancestry of the first prokaryotes. 
Table 1 shows the three urkingdoms to be equidistant from 



one another. Because the distances measured are actually 
proportional to numbers of mutations and not necessarily to 
time, it cannot be proven that the three lines of descent 
branched from the common ancestral line at about the same 
time. One of the three may represent a far earlier bifurcation 
than the other two, making there in effect only two urkingdoms. 
Of the three possible unequal branching patterns the case for 
which the initial bifurcation defines urkaryotes vs. all bacteria 
requires further comment because, as we have seen, there is a 
predilection to accept such a dichotomy. 

The phenotype of the methanogens, although ostensibly 
"bacterial," on close scrutiny gives no indication of a specific 
phylogenetic resemblance to the eubacteria. For example, 
methanogens do have cell walls, but these do not contain pep- 
tidoglycan (24). The biochemistry of methane formation ap- 
pears to involve totally unique coenzymes (23, 25, 26). The 
methanogen rRN As are comparable in size to their eubacterial 
counterparts, but resemble the latter specifically in neither 
sequence (Table 1) nor in their pattern of base modification 
(23). The tRNAs from eubacteria and eukaryotes are charac- 
terized by a common modified sequence, T^CG; methanogens 
modify this tRNA sequence in a quite different and unique way 
(23). It must be recognized that very little is known of the 
general biochemistry of the methanogens — and almost nothing 
is known regarding their molecular biology. Hence, although 
the above points are few in number, they represent most of 
what is now known. There is no reason at present to consider 
methanogens as any closer to eubacteria than to the "cyto- 
plasmic component" of the eukaryote. Both in terms of rRNA 
sequence measurement and in terms of general phenotypic 
differences, then, the three groupings appear to be distinct 
urkingdoms. 

If a third urkingdom exists, does this suggest that many more 
such will be found among yet to be characterized organisms? 
We think not, although the matter clearly requires an exhaus- 
tive search. As seen above, the number of species that can be 
classified as eubacteria is moderately large. To this list can be 
added Spirillum and Desulfovibrio, whose rRNAs appear 
typically eubacterial by nucleic acid hybridization measure- 
ments (27). Because the list is also phenotypically diverse, it 
seems unlikely that many, if any, of the yet un characterized 
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prokaryotic groups will be shown to have coequal status with 
the present three. Conceivably the halophiles whose cell walls 
contain no peptidoglycan, are candidates for this distinction 
(28,29). 

Eukaryotic organelles, however, could be a different matter. 
There can be no doubt that the chloroplast is of specific eu- 
bacterial origin (3, 4). A question arises with the remaining 
organelles and structures. Mitochondria, for example, do not 
conform well to a "typically prokaryotic** phenotype, which 
has led some to conclude that they could not have arisen as 
endosymbionts (30). By using "prokaryote" in a phylogenetic 
sense, this formulation of the issue does not recognize a third 
alternative — that the organelle in question arose endosym- 
biotically from a separate line of descent whose phenotype is 
not "typically prokaryotic" (i,e., eubacterial). It is thus con- 
ceivable that some endosymbiotically formed structures rep- 
resent still other major phylogenetic groups; some could even 
be the only extant representation thereof. 

The question that remains to be answered is whether the 
common ancestor of all three major lines of descent was itself 
a prokaryote. If not, each ur kingdom represents an independent 
evolution of the prokaryotic level of organization. Obviously, 
much more needs to be known about the general properties of 
all the urkingddms before this matter can be definitely settled. 
At present we can point to two arguments suggesting that each 
urkingdom does represent a separate evolution of the prokar- 
yotic level of organization. 

The first argument concerns the stability of the general 
phenotypes. The general eubacterial phenotype has been stable 
for at least 3 billion years — La, the apparent age of blue-green 
algae (31). The methanogenic phenotype seems to be at least 
this old in that branchings within the two urkingdoms are 
comparably deep (see Table 1). The time available to form each 
phenotype (from their common ancestor) is then short by 
comparison, which seems paradoxical in that the two pheno- 
types are so fundamentally different. We think that this os- 
tensible paradox implies that the common ancestor in this case 
was not a prokaryote. It was a far simpler entity; it probably did 
not evolve at the "slow" rate characteristic of prokaryotes; it 
did not possess many of the features possessed by prokaryotes, 
and so these evolved independently and differently in separate 
lines of descent. 

The second argument concerns the quality of the differences 
in the three general phenotypes. It seems highly unlikely, for 
example, that differences in general patterns of base modifi- 
cation in rRNAs and tRNAs are related to the niches that or- 
ganisms occupy. Rather, differences of this nature imply in- 
dependent evolution of the properties in question. It has been 
argued elsewhere that features such as RN A base modification 
generally represent the final stage in the evolution of translation 
(32). If these features have evolved separately in two lines of 
descent, their common ancestor, lacking them, had a more 
rudimentary version of the translation mechanism and conse- 
quently, could not have been as complex as a prokaryote (6). 

With the identification and characterization of the urking- 
doms we are for the first time beginning to see the overall 
phylogenetic structure of the living world. It is not structured 
in a bipartite way along the lines of the organizationally dis- 
similar prokaryote and eukaryote. Rather, it is (at least) 
tripartite, comprising ({) the typical bacteria, («) the line of 
descent manifested in eukaryotic cytoplasms, and (Hi) a little 



explored grouping, represented so far only by methanogenic 
bacteria. 
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